Corpus: koi_wikipedia_2016

Other corpora

2.2.5 Most frequent word beginnings

The most frequent word beginnings as character N-grams for N=1...5 with Zipf's diagram


Zipf's diagram for word beginnings


Gnuplot diagram

Top Characters
word rank frequency n-gram
1 1889 к-
2 1468 в-
3 1430 п-
4 1169 с-
5 1098 К-
Top Character Bigrams
word rank frequency n-gram
1 457 ве-
2 411 по-
3 410 ко-
4 363 кы-
5 355 ка-
Top Character Trigrams
word rank frequency n-gram
1 127 кар-
2 119 кыв-
3 111 кол-
4 110 кер-
5 97 веж-
Top Character 4-Grams
word rank frequency n-gram
1 71 видз-
2 65 весь-
3 51 лэдз-
4 51 аркм-
5 49 поса-
Top Character 5-Grams
word rank frequency n-gram
1 59 веськ-
2 48 посад-
3 42 сёрни-
4 40 район-
5 36 велöт-
633 msec needed at 2017-12-29 19:03